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A re-analysis of the NMC and SLAC data leads to a great improvement in our knowledge of the 
valence d and u parton distribution functions (PDF's) at high x. Standard parton distributions 
with our modifications are in good agreement with QCD predictions for d/u at x = 1, and with the 
CDHSW up and Vp data, the HERA charged current cross section data, the collider high-P t jet data, 
and the CDF W asymmetry data. With the inclusion of target mass and higher twist corrections, 
the modified PDF's also describe all DIS data up to x — 0.98 and down to Q =1 GeV . 
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Recent work on parton distributions functions (PDF's) 
in the nucleon has focussed on probing the sea and gluon 
distribution at small x. The valence quarks distribution 
has been thought to be relatively well understood. How- 
ever, the precise knowledge of the u and d quark distri- 
bution at high x is very important at collider energies in 
searches for signals for new physics at high Q 2 . In addi- 
tion, the value of d/u as x — > 1 is of theoretical interest. 
Recently, a proposed CTEQ toy model W included the 
possibility of an additional contribution to the u quark 
distribution (beyond x > 0.75) as an explanation for both 
the initial HERA high Q 2 anomaly S, and for the jet ex- 
cess at high-P t at CDF ||. In this Letter we conclude 
that a re-analysis of data from NMC and SLAC leads to 
a great improvement in our knowledge of PDF's at high 
x, and rules out such toy models. 

Information about valence quarks originates from the 
proton and neutron structure function data. The u va- 
lence quark distribution at high x is relatively well con- 
strained by the proton structure function F$. However, 
the neutron structure function F^ ', which is sensitive to 
the d valence quark at high x, is actually extracted from 
deuteron data. Therefore, there is an uncertainty in the 
d valence quark distribution from the corrections for nu- 
clear binding effects in the deuteron. In past extrac- 
tions of F£ from deuteron data, only Fermi motion cor- 
rections were considered, and other binding effects were 
assumed to be negligible. Recently, the corrections for 
nuclear binding effects in the deuteron, F^/F^ , have 
been extracted empirically from fits to the nuclear depen- 
dence of electron scattering data from SLAC experiments 
E139/140 Q. The empirical extraction uses a model pro- 
posed by Frankfurt and Strikman ||, in which all bind- 
ing effects in the deuteron and heavy nuclear targets are 
assumed to scale with the nuclear density. The total cor- 
rection for nuclear binding effects in the deuteron (shown 
in Fig. |l|(a)), is in a direction which is opposite to what is 
expected from the previous models which included only 
the Fermi motion effects. The suprisingly large correc- 
tion extracted in this empirical way maybe controversial, 
but is smaller than the recent theoretical prediction || 
(dashed line in Fig. |l|(a)) 

The ratio F^/F^ is directly related to d/u. In leading 
order QCD, 2F 2 i /F 2 p - 1 ~ (1 + Ad/u)/ (4 + d/u) at high 
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FIG. 1. (a) The total correction for nuclear effects (binding 
and Fermi motion) in the deuteron, F$ /F^ v , as a function of 
x, extracted from fits to the nuclear dependence of SLAC F2 
electron scattering data (compared to theoretical model [6]). 
(b) Comparison of NMC F% /F% (corrected for nuclear ef- 
fects) and the prediction in NLO using the MRS(R2) PDF 
with and without our proposed modification to the d/u ratio. 



x. We perform a next-to-leading order (NLO) analysis on 
the precise NMC F^/Fl^ data pj] to extract d/u as a func- 
tion of x. We extract the ratio i 7 2 p+n /i 7 2 p by applying the 
nuclear binding correction i 7 ^ / F 2 n+P to the F% /F% data. 
As shown in Fig. 0(b), the standard PDF's ffl] do not 
describe the extracted F^ 71 /F2 ■ Since the u distribution 
is relatively well constrained, we find a correction term 
to d/u in the standard PDF's (as a function of x), by 
varying only the d distribution to fit the data. The cor- 
rection term is parametrized as a simple quadratic form, 
S(d/u) = (0.1 ± 0.01) (x + l)x for the MRS(R2) PDF, 
where the corrected d/u ratio is (d/u)' = (d/u) + S(d/u). 
Based on this correction, we obtain a MRS(R2)-modificd 
PDF as shown in Fig||(a). The correction to other PDF's 
such as CTEQ3M/4M is similar. Note that since the d 
quark level is small at large x, all the sum rules are eas- 
ily satisfied with a very minute change at low x. The 
NMC data, when corrected for nuclear binding effects in 
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the deuteron, clearly indicate that d/u in the standard 
PDF's is significantly underestimated at high x as shown 
in Fig. ||(a). It also shows that the modified d/u ratio 
approaches 0.2±0.02 as x — ► 1, in agreement with a QCD 
prediction p0| . In contract, if the deuteron data is only 
corrected for Fermi motion effects (as was mistakenly 
done in the past) both the d/u from data and the d/u in 
the standard PDF's fits approach as x — > 1. Figure |^(a) 
shows that d/u values extracted from CDHSW vp/ 
Tip data (which are free from nuclear effects) also favor 
the modified PDF's at high x. 

Information (which is not affected by the corrections 
for nuclear effects in the deuteron) on d/u can be also 
extracted from IT* production data in hadron colliders. 
Figure ||(b) shows that the predicted W asymmetry cal- 
culated with the DYRAD NLO QCD program using our 
modified PDF is in much better agreement with recent 
CDF data Jl2) at large rapidity than standard PDF's. 
When the modified PDF at Q 2 =16 GeV 2 is evolved to 
Q 2 =10 4 GeV 2 using the NLO QCD evolution, we find 
that the modified d distribution at x = 0.5 is increased 
by about 40 % in comparison to the standard d distri- 
bution. The modified PDF's have a significant impact 
on the charged current cross sections jl3| in the HERA 
high Q 2 region, shown in Fig. ||(a), because the charged 
current scattering with positrons is on d quark only. Fig- 
ure |](b) shows that the modified PDF's also lead to an 
increase of 10% in the production rate of very high Pt 
jets |l4|] in hadron colliders. 
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Lepton Rapidity 
2. (a) The d/u distributions at 
function of x for the standard and modified MRS(R2) PDF 
compared to the CDHSW data, (b) Comparison of the CDF 
W asymmetry data with NLO standard CTEQ3M, MRS(R2), 
and modified MRS(R2) as a function of the lepton rapidity. 
The standard CTEQ3M with a resummation calculation is 
also shown for comparison. 
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FIG. 3. (a) The HERA charged current cross section data 
and (b) the CDF and DO inclusive jet cross section data are 
compared with both standard and modified PDF's. 



Since all the standard PDF's, including our modi- 
fied versions, are fit to data with x less than 0.75, we 
now investigate the validity of the modified MRS(R2) 
at very high x by comparing to F% data at SLAC. Al- 
though the SLAC data at very high x are at reason- 
able values of Q 2 (7 < Q 2 < 31 GeV 2 ), they are in 
a region in which non-perturbative effects such as tar- 
get mass and higher twist are very large. We use the 
Georgi-Politzer calculation (h| for the target mass cor- 
rections (TM). These involve using the scaling variable 
£ = 2x/(l + v/l + 4Af 2 x 2 /Q 2 ) instead of x. Since a com- 
plete calculation of higher twist effects is not available, 
the very low Q 2 data are used to obtain information on 
the size of these terms. 

We use two approaches in our study of the higher 
twist effects: an empirical method, and the renormalon 
model. In the empirical approach, the higher twist con- 
tribution is evaluated by adding a term h(x)/Q 2 to the 
perturbative QCD (pQCD) prediction of the structure 
function (including target mass effects). The x depen- 
dence of the higher twist coefficients h[x) is fitted to 
the global deep-inelastic scattering (DIS) i*2 (SLAC, 
BCDMS, and NMC) data IJ^jJI in the kinematic region 
(0.1 < x < 0.75, 1.25 < Q 2 < 260 GeV 2 ) with the fol- 
lowing form; F 2 = F% QCD+ ™ (1 + h(x) / Q 2 ) f (x) . Here 
f{x) is a floating factor to investigate possible x depen- 
dent corrections to our modified PDF. A functional form, 
a( j^-— — c) for h(x) is used in the higher twist fit to esti- 
mate the size of the higher twist terms above x = 0.75. 
The SLAC and BCDMS data are normalized to the NMC 
data. In the case of the BCDMS data, a systematic er- 
ror shift A (in standard deviation units) is allowed to 
account for the correlated point-to-point systematic er- 



rors. The empirical higher twist fits with the modified 
NLO MRS(R2) pQCD prediction with TM have been 
performed simultaneously on the proton and deuteron 
F 2 data with 11 free parameters (two relative normal- 
izations and three parameters for h(x) per target and 
the BCDMS A). We find that empirical higher twist fit 
describes the data well (x 2 /d.o.f. = 843/805), and the 
higher twist contributions in the proton and deuteron 
are similar. The magnitude is almost half of the size 
from a previous analysis of SLAC/BCDMS data be- 
cause that analysis was based on a s (M|) = 0.113, while 
a s {M 2 ) = 0.120 in the MRS(R2) PDF, which is close to 
the current world average. 

In the renormalon model approach jgflj , the model pre- 
dicts the complete x dependence of the higher twist con- 
tributions to F 2 , 2xF\ , and xF^ , with only two unknown 
parameters A 2 and A4. We extract the A 2 and A4 pa- 
rameters, which determine the overall level of the l/Q 2 
and l/Q 4 terms by fitting to the global data set for F 2 
and R{= F 2 (l + 4Mx 2 /Q 2 )/2xFi - 1]. The values of 
A 2 and A4 for the proton and deuteron are same in this 
model. The x dependence of 2xF\ differs from that of 
F 2 but is the same as that of xF% within a power cor- 
rection of l/Q 2 . Our fits can also be used to estimate 
the size of the higher twist effects in xF$ [e.g. the Gross- 
Llewellyn Smith (GLS) sum rule]. The higher twist fit 
in this approach has employed the same procedure as 
the empirical method. Figure [| shows that the model 
yields description of the x dependence of higher twist 
terms in both F 2 and R with just the two free parameters 
( X 2 /d.o.f. = 1470/928). The CCFR neutrino data Q 
is shown for comparison though it is not used in the 
fit. The extracted values of A 2 are —0.093 ± 0.005 and 
—0.101 ± 0.005, for proton and deuteron, respectively. 
The contribution of A4 is found to be negligible. We 
find that the floating factor f(x) for the deuteron de- 
viates from 1 and is also bigger than that for the pro- 
ton, unless the modified MRS(R2) PDF is used. This 
reflects our earlier conclusion that the standard d dis- 
tribution is underestimated at high x region. As in the 
empirical fit, the extracted A 2 value is half of the previ- 
ous estimated value Q of A 2 based on SLAC/BCDMS 
[a s (M|) = 0.113] analysis p^| . Since both of these ap- 
proaches yield a reasonable description for the higher 
twist effects, we proceed to compare the predictions of 
the modified PDF's (including target mass and renor- 
malon higher twist corrections) to the SLAC proton F 2 
data at very high x (0.7 < x < 1). 



There is a wealth of SLAC data 23 in the region up to 
x = 0.98 and intermediate Q 2 (7 < Q 2 < 31 GeV 2 ). Pre- 
vious PDF fits have not used these data. We use the es- 
timate of the higher twist effects from the models, based 
on the data (below x < 0.75) described above. Note that 
the data for x > 0.75 is in the DIS region, and the data 
for x > 0.9 is the resonance region. It is worthwhile to in- 
vestigate the resonance region also because from duality 
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FIG. 4. The description of higher twist fit using the renor- 
malon model with the modified NLO MRS(R2) PDF. The 
CCFR neutrino data is also shown for comparison, (a) Com- 
parison of F2 and NLO prediction with and without higher 
twist contributions, (b) Comparison of R and NLO prediction 
with and without the renormalon higher twist contributions. 
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FIG. 5. Comparison of SLAC F% data with the predic- 
tions of the modified MRS(R2), CTEQ4M and the CTEQ 
toy model at high x and higher Q 2 (20 < Q 2 < 31 GeV 2 ). 
(a) Ratio to pQCD, (b) ratio to pQCD with TM effects, and 
(c) ratio to pQCD with TM and higher twist effects 



arguments [Q it is expected that the average behavior 
of the resonances and elastic peak should follows the DIS 
scaling limit curve. Figure [| shows the ratio of the SLAC 
data to the predictions of the modified MRS(R2) at rel- 
atively large Q 2 (21 < Q 2 < 30 GeV 2 ) where the elastic 
contribution is negligible. With the inclusion of target 
mass and the renormalon higher twist effects, the very 
high x data from SLAC is remarkably well described by 
the modified MRS(R2) up to x = 0.98. The good descrip- 
tion of the data by the modified MRS(R2) is also achieved 
using the empirical estimate [h(x)/Q 2 ] of higher twist ef- 
fects as shown in Fig. |5|(c). Figure || also shows that the 
CTEQ Toy model (with an additional 0.5% component 
of u quarks beyond x > 0.75) overestimates the SLAC 
data by a factor of three at x = 0.9 (DIS region). From 
these comparisons, we find that the SLAC F2 data do not 
support the CTEQ Toy model which proposed an addi- 
tional u quark contribution at high x as an explanation 
of the initial HERA high Q 2 anomaly and the CDF high- 
Pt jet excess. As indicated in Fig. [3](c), the uncertainties 
in the PDF's at high x are small. The difference between 
CTEQ4M and MRS(R2) (with our d/u modifications) is 
an estimate of the errors. 

In conclusion, we find that nuclear binding effects in 
the deuteron play a significant role in our understanding 
of d/u at high x. With the inclusion of target mass and 



higher twist corrections, the modified PDF's also describe 
all DIS data up to x = 0.98 and down to Q 2 = 1 GeV 2 . 
The modified PDF's with our d/u correction are in good 
agreement with the prediction of QCD at x = 1, and 
with the CDHSW up and Vp data, the HERA CC cross 
section data, the collider high-P t jet data, and with the 
CDF W asymmetry data. A next-to-next leading order 
(NNLO) analysis |^2| of R indicates that the higher twist 
effects extracted in the NLO fit at low Q 2 may originate 
from the missing NNLO terms. 
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